Mapping Morphological and Phonetic Features of Catalan: a General Template for Contemporary Atlases and Corpus
نویسندگان
چکیده
In Catalonia, from a general point of view and concerning Geolinguistics, three assessments can be done: a) no new initiatives for creating a general linguistic atlas are expected; on the contrary, the tendency would be to create regional or local atlases or, disregarding cartography, to develop of monographs concerning several linguistic aspects of a certain dialectal area; b) there is no perceived need for an electronic publication of the atlas or the release of an internet version (the general format used is paper); and c) there is a possibility of computerising the data contained in old atlases. The main aim of this paper is to describe the processes of systematisation and mapping of dialectal data based on “La flexió verbal en els dialectes catalans”. The paper is structured in five parts: a) The corpus of morphological and phonetic data; b) Mapping the data; c) Using the program; d) Sound maps; e) Conclusions.
منابع مشابه
ClInt: a Bilingual Spanish-Catalan Spoken Corpus of Clinical Interviews
In this paper we present ClInt (Clinical Interview), a bilingual Spanish-Catalan spoken corpus that contains 15 hours of clinical interviews. It consists of audio files aligned with multiple-level transcriptions comprising orthographic, phonetic and morphological information, as well as linguistic and extralinguistic encoding. This is a previously non-existent resource for these languages and i...
متن کاملFrequency analysis of phonetic units for concatenative synthesis in catalan
Knowledge of phonetic unit frequency is very necessary for developing databases in both concatenative synthesis and continuous speech recognition. In the present work, a large corpus of text was processed and phonetically transcribed to obtain allophone and diphone frequencies for the Catalan language. The corpus was acquired from newspaper articles, in which there were a lot of foreign words t...
متن کاملCatalan Geolinguistics and New Technical Procedures
New technologies are helping researchers to apply new methods to the treatment of dialectal data, accomplishing a variety of research objectives in the stages of data compilation, data processing and the presentation of results. In this regard, dialectology has at least two aspects: a) obtaining new data to learn contemporary linguistic variation; and b) retrieving earlier material to facilitat...
متن کاملA Phonetic-Based Approach to Chinese Chat Text Normalization
Chatting is a popular communication media on the Internet via ICQ, chat rooms, etc. Chat language is different from natural language due to its anomalous and dynamic natures, which renders conventional NLP tools inapplicable. The dynamic problem is enormously troublesome because it makes static chat language corpus outdated quickly in representing contemporary chat language. To address the dyna...
متن کاملThe Assessment of Pragmatic Knowledge in the Online General IELTS-Practice Resources: A Corpus Analysis of Writing Tasks
Motivated by the concept of Communicative Language Ability and the eminence of the IELTS exam, this study intended to scrutinize the representation of functional knowledge (FK) and socio-linguistic knowledge (SK) as sub-components of pragmatic knowledge in the writing performances of both tasks of the online General IELTS-practice resources across three band scores. This quantitative inter-scor...
متن کامل